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ABSTRACT 

This report introduces archivists to t'ra potential of 
expert systems for improving archives administration and alerts them 
to ways in which they can expect intelligent technologies to impact 
federal record-keeping systeius and scholarly research methods. The 
report introduces the topic by describing expert systems used in 
three Fortune 500 companies. It then defines expert systems, 
distinguishes them from conventional programs, and presents the 
capabilities of the technology together with examples of suitable 
applications. Discussion of the building of an expert system 
application begins with a short history of the svolution of the 
technology, followed by a detailed account of knowledge engineering, 
i.e., the process used to develop an expert system. Descriptions of 
several expert systems applications in the federal government 
highlight applications in the Internal Revenue Services, the Social 
Security Administration, and the Office of Management and Budget 
(Executive Office of the President). A report on the library 
profession's emerging use of this technology focuses largely on the 
three national libraries of the federal government: the National 
Library of Medicine, the National Agricultural Library, and the 
Library of Congress. The discussion of recent advances in expert 
systems technology that concludes the paper examines limitations of 
the technology, identifies likely frontiers for further research and 
development, and considers the implications of the technology for 
archives administration. A list of sources and related bibliographies 
is appended. (MAB) 
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This report, Expert Systems Technology and Its Implications for Archives, explores the 
potential application of expert Biystenas technology to archival programs and processes. It both 
introduces archivists to a new sofbware tool that may be used to improve ardiival operations, 
and alerts them to a technology successfiilly used in business, industry, government agencies, 
and increasingly by scholars. 

The report represents the cuhnination of eighteen months of research on artificial 
intelligence and expert systems. In preparing the report, members of the Archival Research 
and Evaluation Staff of the National Archives received briefings from professional staff of 
many cf the coimtry's most advanced artificial intelligence research laboratories. They also 
met with managers of operational expert systems applications, observed demonstrations of 
apphcations m use or underway, evaluated relevant literature, and participated in educational 
traming sponsored by professional associations. This final version of the report was reviewed 
by a team of technioEd experts. 

By releasing this report to the public, the National Archives invites an exchange of views with 
the archival community and related professions on the impUcations of the technology. We 
look forward to your comments on the report and the opportunity to explore the potential of 
expert flystems technology in archival administration. 




I? IN W.WILSON 

Archivist of the United States 
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PREFACE 



The National Archives and Records Administration (NARA) acquires, presei^ves, and makes 
available for use the permanently valuable records of the United States Government. Within 
NARA the Archival Research and Evaluation Staff evaluates the ways emerging technologies 
can be used to improve the management of archival records. The staff assesses technologies 
through the development of technical information reports and pilot/prototype projects, or 
through sample evaluations of new products or systems. 

This report introduces archivists to the potential of expert systems for improving archives 
administration. Further, it alerts archivists to ways in which they can expect intelligent 
technologies to impact federal record-keeping systems and scholarly research methods. 

The report introduces the topic by describing expert systems use in three Fortune 500 
companies. Next, it defines expert systems and distinguishes them from conventional 
programs. The capabihties of the technology are presented, with examples of suitable 
applications. 

Section two and three considers the building of an expert system application. It opens with 
a short history on the evolution of the technology, followed by a detsdlod account of knowledge 
engineering - the process used to develop an expert system. 

The fourth sect«.on reviews several expert systems applications in the federal government. 
The discussion of federal government appUcations features the Internal Revenue Service, the 
Social Security Administration, and the Office of Management and Budget (Executive Office 
of the President). A report on the library profession's emerging use of this technology 
appears next, focusing largely upon the three national libraries of the federal government - 
the National Library of Medicine, the National Agricultural Library, and the Library of 
Congress. 

A discussion of recent advances in expert systems technology is the focus of the final portion 
of the technical information paper. This section examines limitations of the technology and 
identifies likely frontiers for fiirther research and development. The technical information 
paper concludes by considering the impHcations of the technology for archives administration. 
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EXPERT SYSTEMS TECHNOLOGY AND ITS IMPLICATION FOR ARCHIVES 



1. Introduction 

In 1956, the Institute of Radio Engineers sponsored a symposium on the impact of computers 
on science and society.^ The symposium brought together many pioneers of computer 
technology to explore the technical, social, and political implications of their work. One 
speaker, a researcher from International Business Machines, Inc. (IBM), expressed optimism 
about the "first groping steps" the corporation was taking toward endowing machines with 
some sort of intelligence. He went on to comment on what he considered the real promise 
of computer technology: 

"The real importance of machines lies in their capacity as general logical 
devices. It's very easily understandable why, during these first ten years of 
computer technology, we have thought of them generally as calculating devices, 
but it is equally clear...that the really big problems, the problems which can 
greatly affect scheduling of production, with the planning and control of traffic 
in an airline,... or in general with the efficient conduct of any complex activity, 
will require on our part a rath different technique of machine use than we 
have yet developed....So far, \ ^ave all. .tended to think of computers as 
essentially mathematical compu.ijig devices... .It's a rather young attitude on 
our part.... In my opinion, this is only the beginning."'' 

The observations of this young researcher turned out to be prophetic. Although it took thirty 
years, corporations and federal agencies now use "smart" technologies to assist with activities 
that involve human reasoning, such as the scheduling, planning, and managing of complex 
tasks. During the past decade, the trend in computer technology toward smaller hardware, 
improved memory capacity, and decreased costs has encouraged the widespread automation 
of workplaces, and even private homes. As a result, to a great extent computers already 
handle much of the travail of the information age: the sorting and storing of data. To a 
surprising degree, computers now perform jobs once considered impenetrable human 
preserves: executing tasks and solving problems that involve logical reasoning using stored 
information. 

Expert systems are computer applications that can consider a vast amoimt of knowledge, 
then "reason" and recommend a course of action. No longer considered an emerging 
technology, expert systems have advanced from the research bench to operational maturity. 
During the past decade, most Fortune 500 companies introduced expert systems technology 
into corporate operations in an attempt to secure a competitive edge. IBM began 
commercializing the technology in 1985 and now has nearly 400 major expert systems 



"Symposium on the Impact of Computers on Science and Society," held at the 1956 National Convention 
of the Institute of Radio Engineers, Philadelphia (Waldorf-Astoria), March 22, 1956. 

Remarks of Dr. David Sayre, "Sympoaii <n on the impact of computers on science and society," IRE 
Transactions on Electronic Computers, Volume EC-5/3 (September 1956), p. 149. 

1 



7 



applications in use or under development, with six of the leading appHcations resulting in a 
net return in 1989 of $38.5 milHon. For example, one of the applications, CASES, functions 
as a support tool for staff who acquire and move equipment. It performs this task by 
identifying the particular forms that require completion and the signatures needed, and by 
offering precautionary advice and related information to the user. Since its introduction into 
operations in 1986, CASES has significantly reduced the number of routine questions posed 
to finance managers, in addition to reducing signature authorization problems and 
noncompliance with procedures. The system was developed in a year and a half at a cost of 
less than $100,000. 

E.I. DuPont integrated expert systems technology into its operations by internally developing 
many smaU appHcations. By 1989, the corporation reported 600 expert systems in use with 
an average annual return of 10 to 1 on each system. Since 1985, DuPont has used expert 
systems to assist in three types of tasks: 1) diagnosis - for example, determining why a 
product sold to a customer doesn't work, why a manufacturing yield is poor, or which part 
of a piece of equipment is responsible for a malfimction; 2) selection - identifying which of the 
600 widgets manufactured by the company is most appropriate for a particular customer, or 
the correct course of action when one of 200 chemicals leaks into the environment; and 3) 
planning/scheduling - reconciling contention for machinery while minimizing loss and 
production lapses within specified time constraints. Despite the high monetary ratio of 
return, DuPont reports that the greatest payback on its investment results from the 
replication of expertise - the freeing of experi^ on staff to perform more difficult jobs, by 
transferrmg task responsibility fi:om experts to non-experts. 

American Express Company (AE) offers a third example of corporate use of expert systems 
technology. They built a system that assists staff in authorizing tiie credit-worthiness of 
customers with marginal credit histories. In the actual work situation, AE staff are required 
to make each credit determination m less than 70 seconds. The expert system AE developed 
assists staff in this task by analyzing a customer's financial record to arrive at a credit 
approval recommendation for a particular purchase. The company estimates that the $4 
miJion investment in the system protects it from an annual $27 million loss fi^m bad credit 
and fraud and has reduced erroneous judgmer is on purchase approvals by about 75 percent 
Corporate managers report, however, that the system's greatest contribution is that it 
simultaneously peribnns the normally contradictory activities of improving customer service 
while reduang losses. Optimizing sensible credit approvals in marginal cases allows the 
company to maximize its profits. 

These three companies use expert systems, one of several intelligent technologies,^ to 
perform tasks normally thought to fall outside the range of machine capabilities. Business 
and industry are not the only enterprises to discover these technologies. Military and 
intelhgence agencies, as well as countless civilian government offices, use computers to solve 
real-world problems that involve human judgmente. In fact, forecasters anticipate that the 
near-term soaal impact of intelhgent technologies will compare to that of the industrial 
revolution. Over the course of a century, the industrial revolution irrevocably transformed 



Subfielda of artificial intelligence are considered "intelligent" technologies. See footnote four for further 
explanation. 



the entire spectrum of physical labor. Today, intelligent machines are revolutionizing the 
work of the mentally-slrilled: professional and semi-profebsional staff who perform with their 
minds. Expert systems, software technology that assists the human mind, are being used to 
enhance the productivity and skill of intellectual labor in much the same way that great 
machines first assisted human muscles in the last century. 

What are expert systems? 

Expert systems are computer tools that help humans complete a task, solve a problem, or 
midce a decision that requires logical reasoning. As a subfield of artificial intelligenc* = {AD* 
the key characteristic of expert systems Ues in its use of judgments (or human-like reasoning) 
instead of mathematical calculations (or algorithms) to execute procedures. This fe*'' ire 
distinguishes expert systems firom conventional programming. They formulate solutioxid by 
using factual knowledge as well as heuristics, "rules of thumb" that establish the basis for 
an educated guess. Unlike algorithms, a heuristic approach to problem solving entails 
judgments and often a degree of uncertainty. The ability to make machines perform with 
human-like reasoning and uncertain logic represents the chief contribution of expert systems 
technology to computer science. 

Expert systems function as assistants rather than replacements. Most are designed to handle 
about 85 percent of the cases for a particular domain. They can be developed as stand-alone 
applications, as embedded code in conventional applications, or as systems fully integrated 
in operational environmente. 

In a typical apphcation, the system first requests initial information about a problem. It then 
searches the knowledge stored in the computer for a rule, pattern, or model that "matehes" 
characteristics of the problem suggested by the initial data. The application may offer an 
immediate solution or request more information. This process continues until the system 
arrives at a conclusion or recommends an action. Ideally, such systems can explain to users 
at any time the line of questioning, the reasoning behind a particular conclusion, or the 
relevance of a suggested course of action. 

The effectiveness of expert systems performance has been established with a variety of 
intellectual tasks. Some of the most productive systems have been developed for tasks such 
as 1) scheduling/planning - applications that integrate actions, events, and objects within 
specific time and resource constrainte; 2) diagnosis - applications that identify diseases, 
organic substances, machinery problems, and so forth, based on known information; 



Artificial intelligence is a subfield of computer science concerned with developing machines capable of 
performing functions normally associated with human intelligence, such as reasoning, learning, and 
understanding human language. Some of the mig'or application areas within the field are robotics, machine 
vision, machine voice recognition, natural language processing, expert systems, and neural networks. The 
pursuit of machines capable of performing tasks that require "intelligence" has been a source of controversy . 
Opponents argue that human intelligence can never be reduced to machine processing, hence the goal of 
producing "intelligent" machines is not realizable. This paper does not engage the question of whether or 
not AI applications possess reasoning capabilities equivalent to human intelligence because the issue is 
irrelevant for federal program managers. As government staff, the only important issue in considering AI 
technology is whether or not it works •• does it solve an operational problem? 



3) computer-aided learning - applications that use computers to transfer expertise from senior 
to jumor staff for training purposes; 4) selection - applications that classify objects and 
recommend a course of action appropriate to the classification; 5) data interpretation - 
applications that infer patterns and anomaUes, generate likely outcomes, and discover new 
information with large, complex aggregates of data; and 6) user interfaces - applications that 
support query negotiation and information retrieval in an automated environment. 

An impressive rarrrre of appUcations ah^ady has been developed. The financial community 
was one of the eai '^ie^t professions to adopt expert systems, implementing the technology with 
portfolio manage^^oxii, with risk analysis, and to evaluate m^or strategic investments 
Manufacturers use tfie technology to control production processes, diagnose system 
malfunctions, and design complex machinery. Banks and insurance companies use expert 
systems for asset management, credit analysis, medical assessments, and overall 
management functions. Expert systems also provide on-site managerial assistance to retail 
chain managers. 

The scientific professions have developed expert syste.ns for many purposes, including 
diagnosing diseases, providing advice on .cientific experiments, and helping unskilled 
metallurgy staff identify commercially used compounds. In the transportation industry 
expert systems control the handling, labeling, and shipping of hazardous chemicals. 
Automobile manufacturers, such as Toyota, use the technology to diagnose engine problems 
m passenger cars and to customize vehicle design. Expert systems assist cartographers in 
compiling features on nautical maps and charts. 

ExMnples of expert systems are almost as numerous as varieties of professional and semi- 
a!^wf^°? 7 f'^^^^y^!^'^^ are used extensively to provide legal advice, to support 
geologica^ analysis for marketmg purposes, for the analysis of actuarial data, to make social 
service ehgibihty determinations, for mihtary battie management, for evaluation of loan 
dStiw ^ "^^^^"^^ ^^'^""^ knowledge of rural herbal 

Altiiough at last count more than 2200 operational applications existed in the business worid, 
and the estimates for the number of government applications are even greater,^ archivists 
nmh p1 T xf^''^ feasibiUty of using the technology to solve mission 

problems. In 1985, the National Archives and Records Administration (NARA) developed a 
proof-of-concept prototype expert system that emulates the reasoning strategies of reference 



Re lable figures on the extent of operationalized expert systems are unavailable. Expert System Strategies 
H^l !5wl3''r^ '^^V" 2200 business applications in the U S. EdTt^/^u 

acquinng the technology that this number would double in 1990. Feigenbaum, et al reported 1500 

^v^iT:'^ V?'""J" '^t^-^- °f includes gov;^me^^ eS 

7e^Z t^^X rT T'""" Department's Industrial 0«2*S 

reports that the federal govermnent accounts for 75 percent of the U.S. expert systems marVet Although 
the federal government does not track use of the technology, it is commonly recoX^ tha Zst 

Sf numC "^rr '"'"'"^ ^r'"^^"" the mihtary. or the spac^^rCam. However 

the number of civilian government applications is rapidly increasing. nowever. 
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archivists in negotiating patron requests. Although the prototype successfully demonstrated 
proof of concept, NARA did not pursue further research or development at that time 

The only other archival appHcation of the technology is found in a system that helps 
archivists identify objectives for strategic planning in preservation management. The 
National Association of Government Archives and Records Administrators (NAGARA) 
sponsored the development of a stand-alone application, which includes approximately 320 
production rules developed using EXSYS software. The system is currently out for pilot 
testing.' A related research project that will be carried out during the next four years in 
conjunction with the Swiss Federal Archives investigates the use of natural language 
processing to support automatic fiUng of documents in an electronic environment.* Natural 
language processing, another subfield of artificial intelligence, involves the development of 
machines capable of understanding language as people typically speak and write it. 

New and emerging technologies promise to modernize the performance of archival 
administration tasks and impact the conduct of standard archival practice. This report 
introduces archivists to the ways intelligent fc^chnologies, and in particular expert systems, 
can be used to improve the management of archives. The studies conducted with operational 
expert systems reported in this paper clearly demonstrate that the technology improves 
speed, productivity, and consistency when used with many professional and semi-professional 
tasks. Further, widespread use of the technology by federal agencies, and fiiture use by 
scholarly researchers, may make current aspects of archival practice obsolete. For example, 
most information systems developed by the federal government do not include a complete 
record of the rules and regulations that comprise an agency's policy. In the future, however, 
an agency's poUcy likely will appear increasingly as code in intelligent programs. This trend 
suggests the need to reassess conventional appraisal practices. Moreover, forecasters predict 
that once widely available, scholars will adopt "smart" software to inform their analyses. For 
these reasons archivists need to understand inteUigent technologies, begin to integrate them 
into appropriate operations, and consider the operational implications of their use by federal 
agencies and scholars. 



The goal of the project was to apply the findings of the reference archivist o o:-:' jt-concept expert system 
to the design of an automated information system under developmant for the OfTioe of the National 
Archives According to NARA's Five-Year ADP Plan, an expert system front-end interface for this system 
18 scheduled for development around 1994. For further information on the proof-of<oncept system see- 
Renee M. Jaussaud. "The archivist's assistant: from the expert's perspective," in Managing Artificial 
InteUigence ar^ Expert Systems; Daniel A. De Salvo et a1 , "Structured Design of an Expert System 
tTT^.'^ ^L?^ National Archives," in Expert S^mems for Business; and Daniel A De Salvo and Jay 
Uebowitz. The application of an expert system for information retrieval at the National Archives," 
lelematws and Informatics. (For the full citations, see section 6 of the bibliography). 

The NAGARA expert system was developed by Bonnie Curtin. Georgia Department of Aixhives and History 
as one part of a three-part ai chival preservation planning instrument. 

Funding for the project is provided by Fonds national suisse de la recherrhe scUntirujue. Christoph Graf is 
the project director at the Swiss Federal Archives. The principal AI investigator is Margaret King. 
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11. Building Expert Systems : History 

For more than 30 years, artificial intelligence researchers have explored human cognition in 
an effort to develop machines capable of emulating human thinking. The earliest attempt 
to develop an intelligent program resulted in the General Problem Solver (GPS) 
Investigators built the GPS by extracting general strategies for problem solving bom 
mterviews in which people were asked to solve problems out loud. This research typified the 
A' community's emphasis during the first decade of research on discovering generic human 
reasoning strategies. After many years of trial applications, a new tenet, known as the 
knowledge principle, emerged as the fundamental principle for creating machine 
intelligence." The knowledge principle suggests that the quality and quantity of knowledge 
coded in a program is the key to achieving high levels of machine competence with 
complicated intellectual problems. Reasoning strategies, although necessary, have secondary 
importance for machine problem solving. Expert systems emerged in the 1960'8 and 1970'8 
as the practical application of the knowledge principle. By the end of the 1980's, expert 
systems technology achieved operational maturity. 

One of the earliest successful systems, Dendral, assists chemists to determine molecular 
structures by analyzing spectrograph data. Edward Feigenbaum and his colleagues at 
Stanford University spent more than 15 work-years developing Dendral. Today it is found 
in organic chemistry laboratories throughout the worid, and its use has resulted in more than 
50 publications in the chemistry literature. 

During this same period, Massachusetts Institute of Technology (MIT) researchers created 
Macsyma, an application that tackles mathematical problems including calculus, algebra, and 
the solving of equations. After nearly 100 work-years of development, Macsyma performs 
better than most human experts in the symbolic conduct of differential and integral calculus. 

Mycin, developed at Stanford in 1972, diagnoses bacterial blood infections and recommends 
medical treatment. Emycin, an expert system software development tool, evolved fit)m Mycin 
around 1978. It was named Emycin (Empty Mycin) because it contains the logical structure 
of Mycin without its specific knowledge of blood disorders. Emycin marks the development 
of the first expert system shell - a program with logical structures and inferencing strategies, 
but without the specific knowledge required for a particular application. 

Also during the 1970's, the Stanford Research Institute developed Prospector, an appUcation 
designed to advise on mineral prospects of a region by producing favorability maps. Funded 
by the U.S. Geological Survey (USGS) and the National Science Foundation, Prospector 
provided consultation to geologists in the eariy stages of identifying probable ore-grade- 
deposit sites. Since 1983, the VSGS's successor system, Prospector 11, has evolved into a fiUl- 
sized system to aid geologists to identify regional mineral deposit types." 



For further information on Prospector II see: Richard B. McCammon, "Maintaining Prospector II aa a futl- 
sized knowledge based system." AISIG '90 Research Workshop: Working Papers on Full-Sue KnowUdge 
Based Systems. (May 7-8, 1990), pp. 1/14 -1/32. 
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The first commercial operational expert system, Xcon, was developed by Digital Equipment 
Corporation (DEC) in 1980. Xcon configures the company's response to customers* requests 
for VAX computer systems by reconciling space requirements, client specifications, and 
technical constraints. DEC estimates that the application saves the corporation 
approximately $200,000 per month in staff costs, as well as an uncalculated amount in 
manufacturing costs. 

These early expert systems constitute only a firaction of the thirty-year research effort 
expended to develop an operational technology. Although the systems span a variety of 
domains, a common lesson emerged {rom the development efforts. That lesson - the 
nnramnnnt impo.-tan ce of knowledge - formed the basis for all subsequent achievements in 
today's expert systems technology. 



in. Building Expert Systems: Knowledge Engineering 

Expert systems are designed and built by knowledge engineers, computer scientists trained 
in applied artificial intelligence methods. Knowledge engineers develop expert systems in 
dose coordination with domain experts, individuals who by training, education, or experience 
have demonstrated proficiency within a particular field. The chief quality that distinguishes 
domain experts firom others in a crafl is expertise - the capacity to efficiently and effectively 
solve discipline-specific problems. 

Knowledge engineers use the expertise acquired from experts to develop systems in a 
particular domain. Domains are fields of knowledge in which humans use intelligent 
judgments to reach a decision or complete a task. Archives administration could be 
considered a broad domain, while records appraisal represents a narrower domain. Today's 
expert systems perform most effectively with narrow, well-bounded problems that involve 
judgments but not a lot of what is considered "common sense" to solve. 

Expert systems typically consist of a knowledge base, an inference engine, an explanation 
faciUty, and a user interface. The knowledge base acts as the system's repository for facts 
and heuristic rules used by the expert to solve domain problems. Factual rules represent 
laws, regulations, and knowledge commonly agreed upon by experienced members of a field. 
In contrast, heuristic rules constitute judgmental, experimental, and often uncertain personal 
knowledge comprised of rules of thumb that form the basis for educated guesses. These 
procedural tips and "tricks of the trade" help to guide, hmit, and speed up the search process. 
Most individuals invoke widely known hexiristic rules when selecting wine to complement a 
meal; e.g., if the meal is chicken or fish, then select a white wine. This type of heuristic 
knowledge narrows the search, resulting in more efficient decision-making. 

Unlike conventional programs, expert systems are characterized by an absolute distinction 
between knowledge and processing controls. Since the rules and facts found in expert 
systems reside in static knowledge bases, another mechanism must control where to start the 
reasoning process and which rule to examine next. For this reason, expert systems contain 
inference engines, the drivers of the rules that perform the reasoning function. The inference 
engine controls the process of invoking the rules necessary for solving a particular problem. 
Inference engines perform this function through a search strategy. 

7 
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Many different search strategies are used with expert systems, but most are based on one of 
two fundamental concepts: forward chaining or backward chaining. Forward chaining is a 
bottom-up reasoning process that starts with known v'onditions and works towards a desired 
goal. Backward chaining is a top-down reasoning process that starts from a desired goal and 
works back to determine if the requisite supporting conditions exist. A machine asked to 
prove a theorem would use backward chaining, while determining the next move in a chess 
game involves a forward-chaining search. 

Explanation facilities, another critical feature of expert systems, are not found in 
conventional computer programs. Unlike eKpert systems, conventional programs typically 
process data by repeatedly running the same predefined procedures that provide 100 percent 
certain results. For example, automated banking systems arrive at a daily balance by 
processing each day's deposits and withdrawals through the same set of computer procedures. 
Even though the set of numbers changes daily, the same computer procedures are used to 
amve at a daily balance, eliminating the need for an explanation facility. 

Expert systems, however, reach their rev.ommendations by using uncertain and probabilistic 
logic ^vith a unique series of rules. Moreover, the system's recommendations represent 
satisfactory solutions, not absolute answers as with conventional applications. Assuming 
more than one acceptable answer is possible, expert systems are programmed to justify theii- 
recommendations by supplying the rationale. It is this explanation facility that describes 
how decisions are reached and the steps taken to reach them. Its purpose is to increase user 
confidence in the system by supplying the information needed to understand and assess the 
system s logic. 

The final standard component of expert systems is the user inverface. As is customary with 
conventional programs, the interface facilitates the interaction of the computer program u-ith 
the end user. Interfaces typically translate user communications into a form that can be 
processed by the rest of the system. They also convert the machine's processing into a form 
a user can understand. The interface for an expert system is comparable to those found on 
word processing software or bibliographic systems - it provides the user with a m-^ans to 
carry on a dialog with the system (see Figure 1). 

Ejqpert systems software can be developed either with special artificial intelligence hardware 
and programming languages, or on standard computers with either a commercial expert 
system software shell or in a standard software programming language such af "C\ 
CompuUir scientists have developed two AI programming languages: LISF (List Processing), 
the language of choice in the United States, and PROLOG (Programming Logic), most 
popul^ m Japan and Europe. A LISP machine, special symbolic processing hardware, often 
IS used with LISP to optimize programming performance. Although the programming 
environment with the LISP machine includes many desirable features currently unavailable 
with standard computers, the clear trend is toward the diminishing use of symbolic 
processors in favor of standard hardware.^® 
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Symbolic proceBSora manipulate symbols that represent primarily logical rather than numerical operators. 
Today's manufacturers of symbolic proct>,8ors are Symbolics, Inc. and Texas Instruments, Inc. 



8 



14 



Expert System Architecture 
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Figure 1 (Adapted from Paul Harmon and David King, Expert Systems, N. Y. : John WUey 
& Sons, Inc., 1985, p. 34) 

In today's market, most expert systems are developed with some sort of commercial sheU 
Expert system software sheUs are sometimes compared to prefabricated houses; they are a 
set of tools and templates for building an expert system that allow for some customization 
ot the product. Off-the-shelf software tools provide skeleton expert systems that typicaUy 
include a predefined inference engine with empty buckets for developing a customized 
knowledge base. They may also contain a validator program that alerts the knowledge 
engmeer to wntradictory facts and rules, or redundant and missing information, mechanisms 
lor editmg the knowledge base, correcting syntactical errors, or debugging the system during 
development, as well as spelUng checkers, windowing packages, and integration procedures 
for use with conventional software. 

Software shells characteristically reduce development time, but they necessarily limit 
development options. Typically, they support only one or two methods of inferencing and 
knowledge representation. When using software shells, knowledge engineers must accept 
either the hmitations of a particular shell or expend considerable effort to program around 
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them. For this reason, many knowledge engineers prefer to build a system from scratch on 
symbohc processors with an AI language, then convert it to conventional hardware and 
software for use. This is particularly true for large, complex systems or high-level AI 
research applications. Software sheUs tend to be used more frequently by in-house 
developers and commercial vendors to build relatively straight-forward small to mid-range 
applications. ^ 

Wheth^ starting from scratch or using a si jiowledge engineers translate the expertise 
of experts into a language that a machine - lanipulate. System development begins with 
the knowledge engineer eliciting from aj . t or group of experts, the knowledge used to 
solve problems m a domain. The process v . acting, structuring, and organizing knowledge 
acquired fix)m human experts is one of the most difficult, yet critical, aspects of expert 
systems development. Because expert systems emulate human intelligence, the qualitVof 
the knowledge -ported by domain experts direcUy affects the caliber of the system's 
perlormance. Artifiaal inteUigence researchers draw on cognitive psychology to explain the 
variety of problems encountered in eliciting domain knowledge from experts. 

It is widely recognized, for instcuice, that experts often have difficulty communicating 
pertment knowledge, especially 1) knowledge associated with complex tasks, 2) the full range 
of knowledge used for decision-making in a field, and 3) implicit knowledge - knowledge used 
repeatedly in a procedure until the procedure is performed automatically and the expert has 
lost the abihty to remember and verbalize the underlying process (such as much of the 
knowledge used when driving a car). 

Structi^ed interviews and protocol analysis are two of the most common methods used by 
knowledge engineers to acquire knowledge from domain experts. The typical protocol 
^T^'L'^' for domain experts to "think aloud" while making domain decisions Using 
methods borrowed from cognitive psychology, a knowledge engineer learns what experts know 
and how ihey reason with their biowledge." 

Knowledge engineers translate the knowledge acquired from experts into a form usable by 
a computer. This traiisformation of human knowledge into computer-readable symbols is 
referred to as knowledge representation. Production rules are the most conventional form 
of symbolic knowledge representation. They consist of "if' and "then" parts; "if lists a set of 
conditions presented in some logical combination, while "then" represents the conclusion of 
XZa .^'^P^^.^y^te^f that rely on production rules as the basis of representation are 
reterred to as rule-based systems (see Figure 2). 



" ^r^ctX"!'' ^"'""^ ' "'"'^ " ""-'"^^^ -P^-^-- fron, the 



perspective of the domain expert. 
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Examples of Rules 



From NAGARA's strategic planning for preservation application: 

IF: (15.1.8) Do you have a procedure for identifying and selecting 
archives records that require a security copy and/or use copy? NO 

THEN: GOAL I, Reprography Step One, OBJECTIVE D (a) - Develop 
appraisal strategies that relate record reproduction options to use/value 
and original record media. (2.1.1.7.1) - Probability=9/10 



From NARA 's Archivist's Assistant: 
ifgeo = GEO 

and geo_specific (GE0,1) = GEOSPECIFIC 
and do ("reset series (GEOSPECIFIC, RG) 
then geo_search (RG, TOPIC) 



A paraphrase of a rule from NARA's Archivist's Assistant: 

IF the information-sought is suits_against_the_Dept._pf_Int rior 
THEN examine record group 46 cf 70 



Figure 2 

Another common form of knowledge representation, frames, packages knowledge in a way 
that accommodates cognitive associations. Frame-based systems are noted for their efficient 
representation of detailed object relationships, such as the association that a living room 
typically contains a couch and chairs, but not an ocean. New methods of knowledge 
representation with alternate advantages, such as case-based reasoning and reasoning b" 
analogy are beginning to migrate from AI laboratories to the knowledge engineer- 
workbench. 

In addition to acquiring and representing knowledge, the knowledge engineer designs the 
reasoning structure that controls the search and execution of rules (or optimizes the inference 
engine used in a software ahell). To complete the process, the knowledge engineer 
determines how to integrate the use of uncertain knowledge into the reasoning process and 
the types of explanations about a system's recommendations that would benefit the end user. 

One of the key principles of knowledge engineering is iterative development practiced in a 
manner that incorporates extensive involvement of the expert(s) into all stages of building 
the program. Knowledge engineers have adopted an iterative approach to expert system 
design to overcome the difficulties associated with the cornerstone development activity: 
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knowledge acqmsition. ^^owledge acquisition is fundamentally a process of communicating 
the mental reasoning and thought processes of an expert to the knowledge engineer. But the 
problems associated with both the recollection and communication of complex and impHcit 
knowledge are extensive. 

After considerable research, knowledge engineers discovered that iterative prototype 
development can be used to substantially reduce the difficulties associated with knowledge 
acquisition. Iterative prototyping involves the successive repetition of knowledge acquisition 
and knowledge representation until a system reaches the desired goal. Systems unfold in 
stages, with each stage an improvement on the proficiency of the previous stage. This 
iterative method of development expedites knowledge acquisition because gaps in information 
and lapses in logic imperceptible to humans ultimately are discovered while building a 
workable application. For example, a domain expert may communicate to a knowledge 
engineer five steps for solving a problem. Because the expert's reasoning is so automatic, 
he/she inadvertently fails to express an important consideration. The domain expert may 
assure the knowledge engineer in good faith, however, that the procedures conveyed are 
complete. It is not until the knowledge engineer's program is run and found to fail that 
either member of the team ib aware of the omission. Further, iterative prototyping points 
out the knowledge engineer's misunderstandings, misinterpretations, or misrepresentations 
of the expert's meanings. In short, knowledge engineers have discovered that in most 
instances the best technique for recognizing their own mistakes or for prompting experts' 
recall is to show experts their methods in operation. Consequently, expert systems are built 
through an iterative process of development. 

It is difficult to suggest approximate costs for expert systems, as each application entails 
domain-specific problems that affect development times, hardware and software selection, and 
requisite staff resources. Although an inadequate gauge, expert systems usually are 
measured through the number of rules or rule-equivalents.^^ Small systems average from 
50-500 rules and are typically developed on a microcomputer with a software shell. Mid- 
range systems generally contain 500-1500 rules and are developed on supermicrocomputers 
or minicomputers using a shell or a high-level programming language. Large systems exceed 
1500 rules and are programmed for a minicomputer or mainframe environment with a high- 
level or AI language. Software tools appropriate for small applications average from about 
$100-$1,000, The mid-range application tools cost approximately $2,500-$25,000, with most 
selling between $5,000-$8,000. Mainframe software varies in price, but it averages about 
$60,000 - $100,000. As with conventional software, the price/capability ratio of expert 
systems software tools is decreasing. 



The "size" of an expert Bystem is difficult to determine. Ted Senator, U.S. Navy, in a presentation to the 
IEEE Technical Committee on Expert Systems (WDC, June 6, 1990) suggests that the number of rules is 
an inadequate measure, because 'large" systems simply could be Indicative of sloppy knowledge engineering 
and responsible for a system's IneflRciency. Further, as new forms of knowledge representation become more 
common, fewer expert systems contain rules per se. Senator proposed that the size of an expert system be 
evaluated by assessing the problem scope (the number of domains and sources of knowledge), the size of 
the knowledge base (including number of rules or rule equivalents, search space, and lines of code), the code 
n quired to embed or integrate the system into other operations, the performance expectations, the number 
uf u^ers and sites, and the ultimate deployment of the system. 
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IV. Federal Government Use of Expert Systems 

Even though more than 80 percent of Fortune 500 companies are integrating expert systems 
into their operations, the government is stiU considered the greatest user of the technology 
Indeed, the Commerce Department reports that approximately 75 percent of the U.S expert 
systems market belongs to government." In particdar, mihtary, intelHgence, and the space 
program havesponsored artificial intelhgence research and development in this country for 
decades. An Office of Technology Assessment projection estimated that in 1988 expenditures 
lor defense-related artificial intelhgence research would reach $155.5 million, while the 
government wouxd devote only a firaction of that figure, $17 milHon, to civilian AI research " 
For many years the Defense Advanced Research Projects Agency (DARPA), the chief source 
01 military funding for AI research, has funded research projects to develop miUtary systems 
that assist m strategic planning and in determining a course of mihtary action. 

Today, the technological breakthroughs achieved with defense-oriented systems are finding 
application m avilian systems. There are no accurate figures, however, on how many civiUan 
government expert systems applications exist. Neither the Office of Management and Budget 
nor the Congressional Budget Office tracks this type of expenditure acn.^s agenc^s. 
Moreover, it is difficult to establish the pattern of use within any one agency because the 
technology tends to be administered Uke any other mature software tool, and therefore, does 
not appear rs a septate budget item. For these reasons, knowledgeable sources within the 
federal government hesitate to estimate formally the extent of civilian use; however, ofi-the 
record hey speculate that the development of prototypes and operational expert systems is 
currently widespread and growing," and in the next 5-10 years, expect that "inevitably" they 
will be used throughout government. J J' 

The purpose of this section of the paper is to illustrate, through case examples, the nature 
and extent of use of expert systems technology witJiin federal civilian agencies. This section 
provides archivists with a glimpse of the widespread use of the technology for a broad range 
of tasks. The status of the technology is examined in three agencies : the Internal Revenue 
bervice the Social Security Administration, and >he Office of Management and Budget 
(Executive Office of the President). Applications underway in a half-dozen other agencies are 
also briefly mentioned. The sketches that follow introduce readers to a small portion of 
lederal civihan apphcations already operational or under development. 

Internal Revenue Service: Virtually every citizen, as a taxpayer, is subject to oversight by 

l«wJ«nH'^1i J'""?! ^"^'^ responsible for enforcing internal revenue 

laws and collecting the proper amount of taxes. It must perform these activities at the least 
cost to the public and in a manner that fosteis the highest degree of confidence in the 
agency's integrity, efficiency, and fairness. During the past five years the IRS began 
investing in expert systems technology in order to improve its performance of these mission 



Department of Commerce; "Artificial Intelligence," U.S. Industrial Outlook 1989, pp. 26/3-26/5. 

it?nTIT\T7^^ Technology Assessment, Federal Funding for Artificial Intelligence Research and 
Development. (For full citation, see section 5 of the bibliography). 
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tasks. The IRS currently has eighteen expert systems applications in operation or under 
development. 

One of the agency's m^or applications, the Auditor's Assistant, helps staff classify for audit 
the citizen and corporate tax returns most apt to result in additional assessments for the TBS. 
Currently in its fourth year of a five-year project, a real-time pilot implementation of the 
Auditor's Assistant was field-tested at one service center during the 1990 tax season. The 
task of classifying tax returns for audit typically is performed as a two-step process. First, 
tax returns undergo a computerized aJgorithmic statistical screening. Auditors then 
manually examine the "sensitive" tax returns to reduce the list further and to identify the 
key audit issues in the returns that remain. Since audits are extremely time consuming and 
expensive, auditors attempt, through the manual screening, to identify tax returns with the 
greatest potential for additional assessments, and that '^''^^tain the most solid issues to focus 
on. Experienced auditors who classify the returns use i-.ieir own set of heuristic criteria to 
distinguish efficiently between the best and poorest choices. 

To develop the Auditor's Assistant, a group of expert auditors fi:om across the country came 
together every couple of months for several years to participate in knowledge acquisition 
sessions. In these rigorous sessions, the auditors were not only able to achieve agreement 
on all primary issues related to tax return classification, but also developed a set of rules that 
each believed was better than any one individual heuristic. In fact, at the conclusion of the 
sessions, the auditors insisted that the opportunity to scrutinize their decision-making 
process with other expert auditors improved their own individual performance. The IRS 
calculates that use of the Auditor's Assistant will result in an annual $50 million return on 
investment. 

The purpose of another system underway, the Taxpayer's ServictJ Expert Assistant System 
(TSEAS) is not so much to bring in more revenue as to help the IRS solve a disturbing public 
relations problem. TSEAS provides guidance to the temporary employees hired by the IRS 
each tax season to respond to taxpayer phone inquiries. In the past few years, research 
conducted by the IRS revealed that up to one-third of staff responses to taxpayer phone 
inquiries were incorrect. Although troubling, the extent of error is not surprising since the 
information necessary to perform the task is physically dispersed in 159 IRS publications and 
10 voltunes of tax regulations. After attempts to solve the problem using conventional 
programming failed, the IRS decided to build an expert system. The design goal is to create 
an application sufficiently robust to handle about 85 percent of the questions posed by 
taxpayers. 

A pilot implementation that supports accurate, quick, and consistent IRS response to 
taxpayer phone-in questions was field-tested in Boston during the 1990 tax season. As a 
result of the expert system, the Boston office's accuracy rate for responses to taxpayers 
inquiries improved 20 percentage points (from 60 percent to 80 percent) and the overall 



The IRS investment in expert systems technology began with the selection of a cadre of mid-career staff 
with strong backgrounds in tax administration uo receive training in artificial intelligence. The first group 
of nine staff were enrolled for two-year terms in AI programs at the University of Pennsylvania, MIT, 
Columbia, and Stanford. On their return in 1986, the IRS established its AI Laboratory. Today, the 
Laboratory has expanded to include about two dozen staff. 
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accuracy rank of the office rose from twenty-third to tenth place (based on 27 national call 
sites). These figures are expected to improve even more as staff become accustomed to using 
the system on a full-time basis. No hard data is available yet on how well the system 
enhanced productivity, but it appears that the application represents a more efficient way to 
deliver services. IRS plans to expand the field-testing of the system to four additional sites 
next tax season. 

Some of the other IRS applications include: 1) a system, operational since 1988, that 
processes virtually all of their requests for permission to waive the IRS regulation that 
requires companies to file a tax return on magnetic media; 2) another system implemented 
in 1988 that increases the quantity and improves the quality of examiners' letters to 
taxpayers; 3) a system that determines Hie actuarial soundness of employee pension plans 
(the IRS reports that in 1989 this system resulted in a $100 million gain in assessments); 
4) a system, scheduled for pilot implementation this year, that helps determine the tax 
relationship of a subsidiary corporation to a multinational parent company; and, two 
applications still under development, one that determines whether an individual can be 
considered an employee for tax purposes, and the other that makes recommendations on the 
abatement of taxpayer penalties. The?, and comparable applications demonstrate the IRS' 
strong confidence in the ability of expert systems technology to perform as a powerful 
strategic tool in solving modem tax administration problems. 

Social Security Administration: The Social Security Administration (SSA) administers the 
nation's retirement, survivors, and disabiHty insiu^ce programs, as well as its supplemental 
income program. Approximately 62,000 SSA employees operate out of more than 1,300 
district offices and service centers. Claims representatives, SSA's key employees, make 
eligibiUty and entitlement determinations in compliance with voluminous regulations that 
number more than 40,000 pages. These regulations are subject to monthly updates. The 
claims representatives' training consists of 13-15 weeks of classroom work follo^ved by one 
year under close supervision in the field. The ability to recall and apply complex logic from 
the regulations is key to successful performance. Since SSA operates by authority of law, 
accurate and consistent determinations are vitally important. 

SSA created the Expert Systems and Future Technologies Branch (ESFTB) about five years 
ago to introduce new technologies into agency operations. The ESFTB is actively developing 
expert decision-support systems for claims representatives. The ultimate goal is to replace 
the massive paper manuals with an intelligent electronic library of personal-computer-based 
expert systems fiilly integrated with the current mainframe support. Progress towards this 
goal proceeds through incremental development of applications designed to help claims 
representatives identify the pertinent issTiss in a claim, provide a step-by-step procedure for 
aniving at an eligibility determination, and produce both a customized transaction of the 
criteria used in making the decision and personahzed letters to the claimant. 

Specific expert systems applications in o- awaiting field-testing include: 1) a system designed 
for use by claims representatives as a training tool and in client interviews for scheduling 
and executing overpayment waiver claims, 2) a system that produces customized living 
arrangement notices for supplemental insurance recipients; 3) a system that produces 
customized continuing review notices for disability beneficiaries; and 4) a system that 
evaluates gainful activity for disability claims. 
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SSA eJnects th;7;:;Xr°J_!ir-T.'* ""^ evaluated by disability examiners. 

Disability ProgrStZZ^^moZ!" * "'^ admimstration of the 
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systems, a counter-terrorism information system, already is operational in several * al 
offices. The FBI is developing these systems as a way to manage the complexity of nSd m 
investigative case management and to reduce the number of years typically needed to^Sain 
new agents. To accommodate the agents, the investigative system is entirely mouse^wn 
requires no typed input, and draws all information from data iiles. A comparable design will 
be used for all future investigative systems.^** ^ 

At the Environmental Protection Agency (EPA), staff introduced expert systems technolojrv 
by conducting an agency-wide survey to identify the most important problems that lent 
themselves to expert systems solutions. Since the survey, the EPA has fielded or begun 
developing approximately one dozen applications, primarily to assist in tox?c WMte 
management. One application interprets regulatory language governing storage tanks for 
toxic chemicals. Another uses the technology to calculate the cost and time required to clean 
up hazardous waste sites. The Expert Disclosure Analysis and Avoidance System (EDAAS) 
an early apphcation, assists public information speciaUsts to determine which information 
concerning the manufacture and distribution of toxic chemicals may be released to the public 
without compromising sensitive data that the EPA must by law keep from pubUc disclosure. 

The National Forest Service is producmg a number of advisory systems to help foresters 
make management decisions. For example, tiie Red Pine Advisor is used in eastern Michigan 
as a forest administration aid that recommends when to plant and when to harvest. Other 
systems provide guidance for gypsy moth control or for the use of insecticides with otiier 
pests. In future appUcations, the Forest Service hopes to combine expert systems with 
geographic information systems to improve fire management." 

The growing use of expert systems by the federal government demonstrates the confidence 
placed in the technology by agencies. Not unlike archives, in recent years the budgets of 
cr^ian federal agencies have decUned, while the demand for services has risen. These 
conditions have forced civilian agencies to increase productivity while reducing expenditures. 
Some federal agencies use the technology primarily to assist in mission tasks, while others 
concentrate on its benefit with routine administrative probleKis. The applications involve 
tasks such as scheduling, planning, selecting, classifying, training, and interpretation - 
activities performed by archivists in the routine course of their work. Civilian agencies, 
careful to avoid projects with unproven technologies, have found that modernizing task 
performance by using expert systems technology with suitable problems represents a key 
method for managing the "more with less" mandate of the nineties. 
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Information on FBI expert systems is from a presentation made by Gary Gardner (FBI) to the IEEE 
Technical Committee on Expert Systems Apphcations (Washington, DC, June 6, 1990) and from Harvey 
P. Newquist Bloodhounds and expert systems," AI Expert. (For the full citation, see section 5 of the 
bibliography). 

Norville (1990), p. 22. (For the full citation, see section 5 of the bibliography). 
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V. The Library Profession'^ Use of Expert Systems 



Many archives are exchanging information on their holdings in online datahao . 
the use of conventional bibliographic standards. The Hbrary profession is !!!f, • '^'^ 
systems technology chiefly to assist in tasks related to the development of 2"°^ I'^^'J 
databases, in areas such as cataloging and information retrieval. While a smii T p 
prototypes are being developed for other types of mission functions (e.g., acquS^J^'^^^f 
library loan),- surprisingly few pn,jects take advantage of the tecbS^^°,^r In h^^^^ 
administrative tasks, such as planning and scheduhng activities, financial andv,?o Z ^ 
management consultation. Even though operational systems exist in only a h^n^A w 
hbranea, many graduate schools and research libraries are developing prototynea ^ . 
the suitabihty of the technology to Hbrary domains. There are no aggregate esSt^ IT. 
use of expert systems by the library profession, but their keen inteiest in theScWw^' 
llrratui?^ ^""^ pubHcations that appear annu^t the 

Further a recent three-day conference sponsored by the Graduate School of Ubrarv and 
Infomation Science at the University of Illinois, "Artificial Intelligence and Expert SjSema 
Will They Change the Ubraryr , drew more than 100 mid- and upper-level hb™Sage^ 
from around the country - Similarly, the Special Libraries Association is devothig TtS 
day institute to IntelHgent Systems: A Framework for the Future."" Moreover, the CoS 
on Library Resources, a private foundation whose grants are often in the forefix)nt of Ubrarv 
developments, reporte that last year it accepted proposals to grant funds to four exnert 
systems projects while rejecting a number of others.^^ Bibliographic utility vendors aLo 
exhibit substmitial interest in the technology. OCLC recently subsidized a research project 
to assess the feasibility of using expert systems to support automatic title-page cataloging 
Ihe prototype achieved an 80 percent accuracy rate in performing automatic cataloging fix)m 
a monograph s htie page. Before attempting to build an operational system, the investigators 
involved in the project recommend further research in four areas, to: 1) achieve a deeper 
understanding of the task, 2) overcome grammatical irregularities of title page layout, 
3) temper scanning and graphical interpretation problems, and 4) minimize problems 
assoaated with the complexity of the cataloging rules.^^ 
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ConverBation vnth ProfesBor Linda C. Smith. Graduate School of Library and Information Science, 
University of IllinoiB (April 3, 1990). 
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Conversation with the conference planner, Professor Linda C. Smith: 27th Annual Clinic on ubrarv 
Apphcations of Data Processing (March 25-27, 1990). sponsored by the University of Illinois at Urbana- 
Champaign, Graduate School of Library and Information Science (April 3, 1990). 

Held October 22-24, 1990 in Washington, D.C. 

Conversation with Eleanor Sacks of the Council on Library Resources (April 4, 1990). 

Stuart Weibel, et al, "Automated Title Page Cataloging." (For the full citation, see section 6 of the 
bibliography). 
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The three nationa hbraries - the National Library of Medicine, the National Amcnltural 
labrary and the libraiy of Congress - each approach the technology in T^ire!^^^e1 
^e Lister Hill Center for Biomedical Communications of the National Li^ra^ of 

Medln^x, one of NLJTs chief research applications, and conceivably the library pr^sS 
most advanced expert systems prototype, performs as an indexer's assistaTin Sng 

LfJfrX *T ^ '^^^ P">^*yP^ ^*^°^Pt« to improve the quX of 

medical hterature mdex^g by automating to the greatest possible extent index tem seM^^^^ 
and application of mdex rules. Progress on the MedlndEx prototype continues ^t? lfthe 
Inn^^'^^^f L^.^"" comprehensive knowledge base, 2) augmentation of thHser heb 
faahty, and 3) additional testing and evaluation of the system.^ ^ 

Apart from the library's research achievements with AI, several years ago NLM's technical 
^rvices division decided to explore the benefits of using expert systems forlrlpera^^^^^^^^ 
Since summer 1989. the division has devoted a full-tiL staff ^sition t^ exp^^^^^^^ 
development The goal of the first application is to create a catSoging assistL 4" 
mitial projec concentrating on personal name authority control. If sucTessful this 
application ultimately will work in conjunction with the division's automated cataloring 
system to provide expert gmdance on determining the authorized form of names. NLM ^taff 

u •! ^"^"^P^ ^^^'^'^ ^^'"^ ^""^^^ «yst«°^s development because 

MnL^ '''^^^^^ entails a long learning-curve to produce experts. 

Moreover, the staff recogmzed that knowledge engineering might identify areas where 
cataloging rules are vague, contradictory, or in need of clarification. In subsequent 
applications, the staff plans to focus on other aspects of authority work and descriptive 
cataloging, as well as subject analysis and classification.^^ 

l^ft.^lf^''^*^*^''^*'^^^ ^}'^ ^^^^ operational use of the technology 

systems m use and several others under development. REGIS, an application that 
includes an embedded expert system to respond to patrons' queries on African aquaculture, 

tX^^M r ^ r I'''' ^11?^^ °" ^ "°PPy distributed by the National 
Technical lnfomation Service (NTIS). A second application provides reference assistance on 
nutrition directly to patrons. A third serves as an interface that helps users identify and 
execute a search from a personal computer for information stored in DIALOG (a commercial 
online bibliographic database service: NAL plans to use expert systems technology for a 
vanety of purposes, including as a front-end navigation tool for multi-media CD-ROMs being 
compiled on plant information. 

Another application underway, CATTUTOR, serves as a tutorial for training inexperienced 
catalogers. Tlie hypercard application links Machine-Readable Cataloging (MARC) format 
guidehnes and Anglo-American Cataloging Rules, 2nd edition (AAC^2) with a menu-driven 



Susanne Humphrey Lister Hill National Center for Biomedical Communications (NLM), is the principal 
developer of MedlndEx. p p 
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One other application, MEDSTATS. currently being field tested in the NLM reference division, directs users 
to statistical sources in the health sciences. 
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tutonal on descriptive cataloging that instructs and evaluates the performance of trainees 
The purpose of the prototype is to shorten the length of time required to develop expert 
catalogers. Completion of the prototype is scheduled for June 1990. Once completed the 
library community will be invited to evaluate the effectiveness of the prototype and to make 
recommendations for further development. 

A final application, the Pesticide Applicator Training (PAT), provides a computer-assisted 
learning program for individuals who wish to become certified pesticide applicators This 
application is bemg developed by NAL with funding fi^om the Environmental Protection 
Agency in coiyunction with the United States Department of Agriculture (USDA) Extension 
^rvice Every state requires persons who apply pesticides in a commercial setting to attend 
formal trwning and/or to pass an exam. PAT wiU be made available through the training 
services offered by the agricultural extension service to aU these individuals. A recentlv 
completed prototype will be demonstrated for the collaborating agencies in 1990. 

of Congress (LC) estabHshed a committee to learn about expert systems 
technology and to perform a systematic survey within the technical services branch to 
Identify can(hdate apphcations. The purpose of the survey was to determine the problem 
oommns most suitable to expert systems technology and to estimate accruable benefits likely 
to result firom the development of operational systems. Technical services tasks were 
em u Tted against a comprehensive Ust of characteristics identified as "essential" and "highly 

r^^^ I^^u""^^ ^ f^^"^ tl^is assessment, the committee 

^rTv f application areas from a dozen Ubrary domains as the most pn^mising 

^n^S ^ ^l^elflisting assistant, 2) a series consultant, and I) a subject-cataloging 

Slw^r^ ^'"^"-'""^'- ^^u'^ ^^^ri^^^ are stored fmd retrieved through shelflist numbers. 
DmS£^'ilI!^•?"^^^ f "^'"^ representation for an item is largely an algorithmic 
fo^n ^ . f uaticns, however, complicate the process: 1) the need to develop a nmnber 
nlTi^ 1 ^ ^'^^^ of authors within a particular classification, and 2) the 
such ^ ' ^r^' ^^'^ ^ taken into account! 

ZtZ^.^r'^^''^ T^'"^. ^ ^ main-entry heading. In thes^ 

^Srs'^Tre^ate^ ^^^^f'^ ^^^^ ^^'^^^ ^ classification 

Shplfl^intrA !^ documents to manuaUy determine the shelOist slot. The proposed 

SSnthpd^pV ^^^^"^ «y i^^^Pr^ts and'ap'Sies 

SrinT^^^^^^ related documentation, and analyzes the patterns and practices 

impiiat in the shelflist itself to create an appropriate call number. 

rftLrinHvl^''". f"^^ ^^"T^- P"^^^^^^ the most problematic aspect 

n«,^3 Ki ^T""^- ^ descriptive entry fo^ a monograph series presJnS 

c^^^«^nH T ^"^""'"^ ,V monograph series often are Wterized by frequent title 
tT!l^^ T pecuharities, 2) the pertinent nales and procedures are numen>us 
^cPssT^ f^' P'^.?''"' '^^^"^ significantly over the years which makes it 

necessary to relate new items to existing series established with outdated descriptive 
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hmoJ^T'' ^P^--^^ns, pp. 23.25. (For the full citation, see section 6 of the 
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of tasks: 1) estabUshing new series^^^XS^'**'' "? *^ ''""'"^S b«>ad cateS; 
«» appropriate cataloging rules <^S^^tS;,!f"^: "«atments K 

and form of subdivisions on/fl '^^e'^^ces, the precise form of subiect henrfinc« 7""°°** 
sMlledtask^S^tS 

The proposed Subject Cataloging 0>SS T'^ ^\My of LC documentation 

LC. supporting subject catafot^ig ^'ttl^ ^ 'l'* <i'^»^^^ 

Mcies, classification policies^teremt.^^ Z'^**" ''"^''^ subject hea*n» 
electjronic bibliographic^nre ^dTrbt^^^aMh^VT'*'"'''' "^"^ inCacTwSh 
databases. The proposed systeTwoSld match % subZslZ 
thesaurus, display broader and narower te-T. I'""''? ''^ "^taloger againsH 

^~«':»'»s. and search for reco^end 
classification number for each item tEh „^ also would devetoa 

a thesaun^-assisted search of tli dlS^rs^i?,?"'^^ '"''j^^' ^''"^^^ "r Z^^ 

the Subject Cataloging Consultot wouU^t^ ■ ' P"'P<»«'» anticipate that 
insistency of practice, and the ,»tentiLTd K^H *• ? iff'''*'" P-^^'ivity, imp^^ 
three proposals described here apwTtol^ n • .*''"''"''on of scarce expertise LC's 
aecisions await a detailed cost-benefit analysis. '^"^^^^^^ plications, but implementation 

hJ^^^edif L^Xt^^^^^ ^'?-'<^es. in combination with 

conventional library. Ve compuSed l^'*"" "^*^' '"'^^ « "thinking ofT 

tfhnoloa-, represents the first step Sth^»r^°^^ next-generation 
?f computer networks that provide of waU-less institutions ^mS 

tTa v«l; r^^P^'^^y-'^^^^^tjounialarticC?^^^^ ^"^ages, economic 

- to satisfy patron --^^-SZ^ia^^^^-J^^^^^^^ 
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VI. state of the Technology 



The proficiency achieved in current expert systems technology is the result of nearly thirty 
years of research. Within the past five years, several key advances mark its maturity into 
a reliable software tool. First, it is uo longer necessary to build an expert system from 
scratch. A seasoned AI industry has transformed the lessons learned about solving logical 
problems into structured software shells. In the near term, forecasters predict the 
appearance of task-oriented shells that will improve the performance of inference engines 
marketed for specific domains. Second, expert systems, regardless of how they are 
programmed, typically can now run on many different computers, can be embedded into 
standard appKcations, and can process information stored in database management systems. 
The ability to integrate expert systems technology with existing operations has ehminated 
one of the key obstacles to widespread adoption.'^ And third, the development of alternate 
methods of knowledge representation, such as case-based and analogical reasoning, opens the 
technology to a wider range of domains and more compKcated problems. 

But probably the most profound breakthrough has just occurred with the recent release by 
IBM of its new expert systems product, called The T^iitegrated Reasoning Shell (TIRS). 
According to IBM, TIRS provides new features in addition to functionahties comparable to 
existing tools, at a fi-action of the standard price. Even if TIRS encounters strong market 
competition, the IBM seal on expert systems technology can be expected to go a long way 
towards establishing both its legitimacy and standards in the industry. 

The advances achieved with expert systems technology, however, tend to be tempered by a 
fundamental weakness: brittleness. Expert systems only know what they know. They are 
absolutely useless when asked to solve a problem or reason about a task that falls outside 
their knowledge base. For instance, a medical diagnosis application may recognize a 
description of a car covered with rust spots as measles, because 'ts knowledge base does not 
include the common-sense observation that inanimate objects do not contract infectious 
diseases. As a result, suitabiUty of the domain to the current technology must be carefully 
assessed. The most suitable domains rely on narrow, highly structured factual knowledge. 
For this reason, expert systems are designed to assist, rather than replace staff, freeing 
experts to concentrate on the more complicated problems within their domain.^' 

Given this limitation, what explains the widospread use of expert systems technology by 
business, industry, and the government? To answer this question, several years ago Dr. 
Edward Feigenbaum and his associates at Stanford University, performed a trail-blazing 
study that examined the effectiveness of 1500 operational business appUcations. The key 
finding of their research is the dramatic increase in productivity that results fi-om use of the 



See for instance, David Freedman, "The big payoff in expert systems"; and "Expert system chip may ease 
integration," Computerworld, for advances in integrating expert systems technology with conventional 
systems (see section 3 of the bibliography for full citations); see also Kamran Parsaye, et al., Intelligent 
Databases: Object-Oriented, Deductive Hypermedia Technologies, N.Y.: Wiley (cl989). 

From keynote address made by Edward Feigenbaum to the Seventh Annual Intelligence Community 
AI/Advanced C!omputing Symposium (Reston, Virginia), October 4, 1989. 
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technology. Productivity was measured in two ways: 1) increases that result Irom speed-up 
of task performance and 2) increases that result from transfer of expertise. 

Feigenbaum found that assistant-type applications commonly performed professional and 
semi-professional business tasks at rates twenty, thirty, and even forty times faster than 
previous operations. Any change that shows an order of magnitude increase in speed of 
operation (speeds at a factor of 10 greater than the original) is considered revolutionary. In 
the Feigenbaum study, corporate and industrial managers reported increases in speed from 
16 to 400 times greater in tasks using expert systems appHcations - representing from 1 to 
2 orders of magnitude of change. The impact of just one order of magnitude increase in speed 
is comparable to the difference between driving a car instead of walking, or taking a jet in 
place of a car to reach a destination. 

Further, expert systems technology increased pi-oductivity by placing the expertise of experts 
into the hands of those less skilled performing the same task. Corporate managers contend 
that for many tasks the top one percent of an organization's staff substantially outperforms 
the rest of the work force, often exceeding productivity of the bottom half by a factor of 
twenty.^ It is expertise - the combination of knowledge, experience, and personal heuristics 
- that enables the top one percent to arrive at optimum solutions in the least amount of time. 
Expert systems, built on acquired expertise, transfer the expertise of the top one percent 
performers to the least productive staff members. Feigenbaum's study found an order of 
magnitude increase in organizational productivity as a result of expertise transfer, and 
established that overall operational expert systems generate a fifty-fold increase in 
productivity for domain tasks. 

The striking increases in productivity were not the only benefits reported. Managers assume 
that the greater consistency imposed on d'Kjision-makmg by expert systems reduces legal 
vulnerability. Insurance companies, corporate sales departments, and even the IRS and SSA 
claim that inconsistent treatment of customers (or citizens) encourages legal action. Expert 
systems also promote organizational stability by capturing and distributing a corporate 
memory otherwise lost through resignations, retirements, or transfers of critical staff. 
Further, expert systems expose inconsistent, redundant, or illogical procedures. They are 
arguably the ultimate tool for learning about a domain and establishing a codified set of rules 
that represent the logic and heuristics involved in task performance. Further, they offer 
predictability. The performance of expert systems, unlike humans, does not suffer from 
bumt-out Fridays. 

Moreover, expert systems help organizations manage complexity. Contemporary work 
situations often require people to master impractical amounts of knowledge. Expert systems, 
however, can perform favorably with very large volumes of otherwise impenetrable 
information. For instance, Boeing Corporation decided to develop an expert system to assist 
mechanics in aircraft repair. Boeing selected this domain because the repair of any one 
aircraft malfunction typically requires the mechanic's intimate familiarity with at least 
several feet of manuals. If the amount of information found in several feet of manuals is 



See for instance, Augustine's Laws (pp. 44-45), by Norman Augustine, President and Chief Operating Officer 
of the Martin Marietta Corporation. (For the full citation, see section 7 of the bibliography). 
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multipUed by hundreds of possible malfunctions it is apparent why aircraft mechanics need 
automated intelligent assistance.'^ 

Nonetheless, expert systems technology is still an evolving enterprise. During the next few 
years, forecasters predict that an industry standard for software shells will emerge, espeaally 
given the appearance of an IBM tool. This will make it easier for technology consumers to 
evaluate the tools on the market.'^ Forecasters also anticipate the development of well- 
defined procedures for managing knowledge engineering projects. Seasoned developers ol 
operational systems appear ready to estabUsh guidelines for iterative development and 
stsmdard time fi-ames for production. To advance this work the IEEE Technical Committee 
on Experi; Systems AppUcations sponsored a conference on "Managing Expert bystems 
Programs and Projects" in the fourth quarter of 1990. 

Stm needed, however, are verification and validation (v&v) procedures - solid methods for 
confirming the extent to which a system meets a specification, is internally conect, and solves 
a real-worid problem. The lack of adequate v&v procedures remains a key obstacle limiting 
the industry'^ abiUty to create very large knowledge bases. However, interest by the defense 
industry and the National Aeronautics and Space Administration (NASA) in autonomous 
systems - remote systems capable not only of performing a task, but also of monitoring and 
repairing themselves as necessary - has pushed this issue to the firont of the near-term 
research agenda. 

Long-term research addressing the more fundamental problem of application brittleness 
focuses on two areas: 1) the development of very large "basic" knowledge bases, and 
2) machine learning. The industry's key approach to overcoming application fragility is to 
develop very large "basic" or common sense knowledge bases. The most comprehensive 
project is a collaboration among leading industir research centers who work together under 
the rubric of the Microelectronics and Computer Technology Corporation (MCC). MCC has 
devoted approximately twenty work-years to the development of Cyc, an incremental project 
to encode the basic rules of physics, cultural belief systems, facts of history, and common 
activities of humans into a system that will serve as a platform to support specific domam 
applications. While still in its infancy, Cyc has been used with an application that helps 
salespersons handling the sale of large computer systems determme customer requirements. 
Since Cyc contains basic knowledge such as: "rooms in physical structures must be entered 
and exited through a door," it can assist, for instance, in calculating the amount of cable 



Remarks by Richard Anderson, Boeing Corporation/Kent Aerospace Center, to representatives from the AI 
Steering Committee (Seattle; January 25, 1990). 

There is quite a bit of activity directed at developing standards for expert systems. The American Institute 
of Aeronautics and Astronautics (AIAA) is working on a standard terminology glossary, as well as 
benchmarks for evaluating software tools and tool performance; DARPA is supporting work on the 
development of knowledge representation standards; and the Institute of Electrical and Electronic 
Engineers (IEEE) Technical Committee on Expert Systems Applications (ESA) is considering expert systems 
development standards. 
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needed to establish a network, given particular information such as room dimensions.** In 
a related efifort, Edward Feigenbaum recently announced a national initiative, that includes 
developers from government, industry, and academia, to codify basic knowledge used in the 
fields of science and engineering.^" These platforms, such as Cyc and the science and 
engineering effort, are considered the chief method for correcting expert system brittleness. 

A second long-term research pursuit is machine learning - computer programs that improve 
with use as a result of actual or simulated experience. Researchers attempt to produce five 
types of leammg with computers: 1) rote learning, where a computer identifies the input of 
a problem it already has solved and avoids making the same search or calculation mistakes 
twice, 2) learning from examples, where rules are induced Gcom examples, and the rules fit)m 
new examples are automatically incorporated into a knowledge base, 3) advice giving/advice 
taking, where the computer figures out how to break large aggregates of knowledge into 
processable chunks to integrate them into an existing knowledge base, 4) learning by 
explanation, where the computer induces rules based on a previous example, case, or theory 
that solved a problem, and 5) discovery, where computers generate new knowledge from 
amounts of information too great for human processing, by identifying patterns or anomalies, 
or by suggestmg likely future scenarios based on a stated condition. Discovery constitutes 
the most sophisticated approach to machine learning. The Central InteUigence Agency (CIA) 
IS usmg th.3 approach in developing STRADS (Strategic Automated Discovery System), an 
application that can forecast or "discover" plausible international political, military, and 
economic events by analyzing an extensive intelligence knowledge base.^^ 

New forms of knowledge representation such as case-based reasoning and analogical 
reasoning expedite the programming of machine learning. For example, a manufacturing 
application developed by the Lockheed AI Center (Palo Alto, CA.) recommends a method of 
selection and arrangement for firing and cooling thousands of ceramic pieces of different 
shapes that minimizes breakage and optimizes production flow. To arrive at an optimrd 
layout the system considers the size and shape of the various pieces, the furnace loid 
capacity, the mtervals at which the pieces are needed for production assembly, and the time 
required for each load to cool. The expert; system first requests information on the materials 
awaitmg firmg then uses case-based reasonmg to select fix)m its memory a successfiil layout 
that most closely approximates the current reported conditions. The system recommends four 
diflerent approaches and provides a rationale for each one. The user selects an approach, 
nms the heatmg/cooling process, proceeds with production, and then rates the degree to 
which the approach succeeded or required modification. It is at this point that machine 
learning occurs, as the system automatically incorporates feedback fix)m the user on the 



For more information on Cyc, see: Douglas B. Lenat and R.V. Guha. Building Large KnowUdge-Baaed 
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^'sfo Development, CIA) presented information on STRADS at the 

AISIG 'go Conference (Washington, DC, Panel 8, May 11, 1990). INLEN, another discovery tool presented 
at this conference, is being developed by Kemieth Kaufman of Geoi^e Mason University P^«»«"^ 
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implemented recommendation into its knowledge base. The recommended action with the 
user's feedback becomes a new case that the system draws on in future dialogue. In this 
way, the expert system grows more knowledgeable with use. 

Modern expert systems technology is sufficiently robust to support a wide range of 
operational systems, providing five key criteria are met: 1) the appHcation involves a suitable 
domain, 2) articulate and willing experts are available, 3) developers possess suitable training 
and experience, 4) end-user involvement is built into development, and 5) executive 
management supports the enterprise. The absence of any of these criteria will likely shelve 
an otherwise feasible project. 

Suitable domains are characterized by specific task attributes and operational environments. 
There are four task attributes that should be considered. First, tasks should be sufficiently 
labor-intensive to assiu-e that an expert system provides a reasonable return on investment. 
The number of staff and the time it takes to perform a job should warrant automating. 
Second, the task should require expertise and the need to replicate and distribute it. It 
probably is not worth the effort *o create a system that automates the ordering process at a 
fast food restaurant. Third, the task should involve judgments that rely on a clearly 
demarcated set of knowledge and principles. Decisions that hinge primarily on common 
sense are currently unsuitable for the technology. Fourth, the domain should tolerate 
"satisficing" solutions - degrees of acceptable answers, rather than only one right answer. 

Characteristics of the operational environment should be assessed in conjunction with task 
attributes. The proposed setting for an expert system application should have at least some 
of these elements: a shortage of skilled staff to perform a task; an expert's neglect of pressing 
work due to time spent on more mundane problems; a desire to improve the productivity of 
staff; a desire to insure consistency of approach and continuity of control between shift;s; a 
need for self-paced educational instruction; a need to preserve or distribute organizational 
expertise; a need to handle greater levels of complexity in performing tasks; a need to create 
expertise in an area where not enough knowledge exists; a need to provide expertise in boring 
or hazardous jobs that do not attract or retain experts; or a need to provide more consistent, 
timely, reliable, and high-quality decisions. 

Once a domain is deemed suitable, the availability of experts should be considered. Most 
knowledge engineers ask managers to release the staff membei<s) they can least afford to do 
without - the person that everyone performing a task goes to with questions. Moreover, 
m^or applications typically require the use of multiple experts. In any expert systems 
development project, the experts must be given adequate release time to work with the 
knowledge engineer. Beyond time, it is helpful if domain experts are interested in cognitive 
processes and the communication of ideas. Since the transfer of the expert's mental model - 
symbolic networlts and patterns of relationships (for instance, metaphors and analogies) they 
use to understand a problem - to the knowledge engineer is fraught with difficulties, it helps 
to have a person who likes to figure out the "how" and "why" behind their actions. The 
expert(s) also should be interested in the project and familiar with its goals fi-om the ou set. 

Expert systems also require --alified developers. What constitutes a "qualified" developer 
differs with applications. In .^^neral, "personal" stand-alone applications, those that primarily 
help one person perform one part of a task, have different development requirements than 
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the marketmg techcs used by many sheU vendors, naive users have a tendenw to <LXd! 



^^^X^^T^ *" P^"*"™ *«^«vely as knowledge engineers only in Umited 
^umsiances when the following six conditions are met: 1) the proposed aoDlication io a 

Kte L a Znnl^K^ • ^"f V ^ ""TO'^ation/agency. 2) the expert is computer 
ex^ enthusiast, 3) the proposed appUcation will be used strictly by the 

Sn^lUem S^^LT'*';°^''??"*'"° '^^ expert's jurisdiction, 4) the method of sohring 
lea<^T^Tfi^^ chaxactenjKd by an exceptional degree of expert consensus, 6) the domai^ 

^te^^l tim'^^^^T ^PP'^'f '•'^t """"^^ th«" « fe* hundred X 
and w a less than optimum system is tolerable (users will accept considerable sv^tem 

mefficiency, idiosyncratic logic, and less than peak performance) 

W^h^ reprogram around shell Ur tations, ascertain tiie most effective form of 
^dtt^Tf "T"*^*:"" and inference for a p^blem, a«iuii^ and model d^mmn W^dg^ 

s^Ie" w^k iLs'uX .^»r Knowledge engineers perform WgW; 

»5«»ttJo that an application uses the technology and its tools to greatest 

advantage, serves the appropriate audience, and ex.«:utes logicS a highly effident Cner 

S!''" '^™'»P"'«'" °f a successful expert system application requires end-user 

rdrrofXLnhvT"™-''™' ^"PPort- End-user input s?ourdtfZ 

m^if.^^ M !■ ^PP""'*''""- especially the interface, from project inception The best 
projects .uild end-user investment into tiie application. No matter whaUhe reason ^ 
apphcation that is rqected by ita intended audience constitutes failure. Telr maXme^t 
are^ equaUy unportant source of support for a project. Their endoreement is viS to^"^ 
dtKrp":^e:tr ^-""^ «^ encountered by 



Vn. Implications for Archives 



^^^nlitZ^iJXT^'^ '^'^ ^"Pe of «"t«-°ation. But 

now, expert systems technology ofTers archival managers a software tool that can bo used to 
solve an entirely new set of oiga.ni^atioaal problems The widespreaZsTof thTtechnoW 
by government and industry sujcesls at least three impUcations for archivists 1) a stoto 

l^^^ "^'l?'* ^""""'"^ ^'^ ^'^'^ mLagement of aXi 2 aS 
method IS needed to appraise existing expert systems, and 3) research is need^ toSeCS 
an appropnate response to the scholarly use of new "intelligent" research metho^kgtes 

systems suggest^ that archives would enjoy similar benefits by operationalizing the 
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technology. The studies performed on commercial operational systems clearly demonstrate 
the technology's broad applicability with both suitable mission-related tasks and routine 
administrative problems. The increases in productivity alone warrant an evaluation of the 
suitability of the technology to archival domains. The speed of completing tasks, the 
distribution of expertise, the increased consistency in decision-making that deters lawsuits 
or citizen coLiplaints, the customized instructional aids, the retention of corporate memory, 
and the management of complex functions are all desirable outcomes. 

The prospects are excellent that expert systems technology can improve modem archives 
administration in a wide range of tasks performed in managing records centers to appraising 
records. To maximize the technology's benefits, archivists need a strategy for developing 
informed infrastructures, identifying candidate appUcations, and integrating the technology 
into existing operations. 

Second, the growing use of intelligent technologies by government and institutions raises 
questions about the appraisal of the structure and content of these systems. Archivists will 
face the same set of medium-related problems in appraising expert systems as associated 
with other forms of electronic data. In addition, however, they also will encounter unique 
problems in evaluating the long-term value of the content of these policy-laden systems. 
Conventional programs, unUke expert systems, do not regularly incorporate an agency's key 
policy and interpretive practice into program code. Expert systems knowledge bases by 
design, however, need a complete set of pohcy rules with interpretative logic to support task 
performance. The appearance of "policy-driven" electronic systems promises to pose 
additional problems for archivists as regulations and procediures that historically appeared 
in a manager's file begin to appear exclusively as code in an expert system. Indeed, panelists 
at the 1990 Artificial Intelligence Systems in Government Conference held in May articulated 
that the tendency for expert systems to serve as the chief receptacle for an agency's policy 
represented an "obvious trend.""" As such, archivists need to monitor the use of existing 
expert systems to determine at what point adljustments in archival practice may be 
appropriate and necessary. 

Finally, the growth in social scientists' use of quantitative research methods in the last 
seversd decades suggests that there is a discemable relationship between the emergence of 
new technologies and scholarly research trends. Social scientists proved quick to adopt 
quantitative methods of analy3is when conventional computing capabihties became available. 
Currentiy, researchers are using not only machine-readable numeric records, but have begun 
to encode textual source material.* into electronic form to facilitate new types of 
computational analysis. 

Textual encoding is especially prevalent among humanities scholars. For example, the Data 
Collection Initiative (DCI) is a far-reaching effort to acquire and make available for research 
purposes a massive electronic text corpus that will serve as a comprehensive research 
resource. Sponsored by the Association for Computational Linguistics, the DCI is the most 



Statements made by panelists at the AI Shells session, AISIG '90 (Washington, DC, May 11, 1990). 
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extensive international collaboration of its kind The ultimate goal of the project is to develop 
a global electronic library available for research use through online terminal access.^' 

Smaller, more focused humanities projects to encode text in machine-readable form are also 
underway, including: 1) Oxford University's computer database for pre-Restoration Enghsh 
drama, as well as one for texts used by honor students in classics and modem languages, 
2) Queens University's electronic library of Canadian Hterature, and 3) a project by scholars 
in Toronto and Otago, New Zealand to create a Tudor textbase of wriiten works fix)m the 
sixteenth century. Himianities scholars predict that the millions of words of text already 
available in machine-readable form represent only a minute fraction of textual source 
materials that will be encoded in the next ten to fifteen years.^^ 

Concerned about the speed with which researchers are creating textbases for computational 
research in the absence of agreed upon encoding standards, the Association for Computers 
and the Humanities, in conjunction with the Association for Computational Linguistics, and 
the Association for Literary and Linguistic Computing received funding from the National 
Endowment for the Humanities (NEH) and the European Economic Community to implement 
a proposal to establish guidelines for the encoding and interchange of machine-readable 
textual records. The first phase of funding will be devoted to the needs of literary, Unguistic, 
and text-oriented historical research.^" As with conventional archival records, scholars 
beheve that the reuse of textual databases by scholars other than the original e».:oder8 will 
soon represent - if it does not already - the predominant use. For this reason they have 
launched a campaign to establish a machine-readable text archive designed to serve a new 
generation of researchers/" NEH awarded Rutgers and Princeton universities a planning 
grant for this purpose. 

The reason textual encoding has significance for archivists is because these efforts suggest 
the emergence of new scholarly research methods. As massive historical, political, scientific, 
and Uterary text bases become available in machine-readable form, "inteUigent" text analsrsis 
represents the logical progression in research methods."*^ Tomorrow's researchers, computer 



ConversationB with Don Walker, Bellcore (May 14, 1990), and Mark Lieberman, AT&T Bell Laboratories 
(June 5, 1990). 

Proposal for Funding for an Initiative to Formulate Guidelines for the Encoding and Interchange of 
Machine-Readable Text, by The ABsociation for Computers and the Humanities, the ^"^"J'^JJ.h 
Computational Linguistics, and the Association for Literary and Linguistic Computing, prepared for NbM 
(1988), p. 12. 
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In fact, artificial intelligence is already being used for the purpose of historical analysis. 
in his book Arti/iciaZ Intelligence: Applications to Logical Reasoning and Historical Research (p. ^^^^^^^ 
the study by French social historian Beatrice Henin, on the effect of King Louis XIVs mi - ^^^^^ 
centuiy order to expand the city on people living in different quarters of Marseilles. inventoriee 
Henin developed a computer file of leasehold documents drawn up by notaries and P™*?^ jLj^yj^j^ ^^j. 
taken at the time of deaths. Towards the end of her research, she became interested in the 
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veterans since elementary school, are not likely to sift through thirty feet of paper records 
when new technologies permit intelligent pattern-matching, anomaly detection, and policy 
inference with massive volumes of electronic information.^^ In the future, the msoority of 
scholars can be expected to choose computers in combination with intelligent software as the 
research method of choice. The shift in soh'^larly research trends toward computer analysis 
of global libraries of digitized textbases merits the attention of archivists. In fact, it is likely 
that unless archival practices begin to address the changing research methods of patrons, the 
prestige, authority, and fimding of archives is destined to rapidly diminish.'*^ 

For three reasons - 1) expert systems promise to improve the management of archives, 
2) expert systems pose unique challenges for archival appraisal, and 3) expert systems 
inaugurate the use of new research methodologies - the archival profession needs a solid 
working knowledge of the technology, to identify its implications for the management of 
archives, and to transform archival practice as necessary. 



Conclusion 

During the next decade, the use of expert systems, especially in combination with other new 
technologies such as hypermedia and hypertext, supercomputers, digital imaging, and optical 
disks, will revolutionize the management of archives. The impact of the technology will be 
felt most strongly by corporate, university, and the spectrum of government archives. For 
example, Minnesota recently adopted a state-wide strategy for introducing expert systems 
technology into government operations. The strategy ir rludes a collaborative program with 
IBM, whereby the state, in conjunction with a local veudor, will develop 48 expert systems 
that perform basic government functions which IBM will market as customizable shells to 
the other forty-nine states. The state's governing leadership anticipates that the royalties 
received from the sale of these systems both will defray initial project costs and support 
additional expert systems development. To implement the strategy', Minnesota established 
a ten-person Knowledge Systems Center and instituted new state job classifications and 



of houses from different social classes. Her use of artificial intelligence to analyze pictures on the walls of 
rooms, largely with religious themes, led her to develop a new model which can be applied to the 
understanding of Protestant and Catholic families in England in the 17th century. 

In a number of schools in England instructors already are using artificial intelligence to teach students 
about historical research methods. For instance, AI technologies have been used to develop front*ends that 
allow middle to high-school age students to query census databases; in the design of computer simulations 
for events such as the Russian Revolution or the development of the European Economic Community to help 
students understand the meaning of historical context; or, with machine-readable trade directories from 
the 19th century to allow twelve year-old students to analyze changes in a community over time. See, 
Richard Ennals, Ar^i/icia/ Intelligence: Applications to Logical Reasoning and Historical Research (pp. 47-63; 
75-85, and 96-103), 

Avra Michelson (NARA/NSZ) and JefTRothenberg (The RAND Corporation) are preparing a paper on the 
policy implications for archives of the relationship between new technological capabilities and changes in 
scholarly research trends. 
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raining programs designed to produce the necessary para-technical staff/^ The Minnesota 
strategy suggests that state governments are prepared to integrate the technology into 
existing operations. 

Archivists need a deep imderstanding of exr.ert systems technology before attempting to 
determine its suitability to archival domains, or its likely affect on core mission functions. 
The requisite first step is to develop informed infirastructures within archival repositories. 
Educational programs are needed that encourage managers of existing systems, researchers 
fi*om AI laboratories, commercial contractors, and allied professionals to share their expertise 
with archival staff. Once staff members are sufficiently scho( led in the technology, they can 
begin to identify candidate applications for expert systems development. Research 
prototyping may be desirable for problem domains where feasibility is questionable but the 
potential payback high. This education, research, and analysis will estabhsh the foundation 
for developing a strategy to integrate the technology into existing operations and for 
monitoring the implications of existing systems for archives. 

However routine, archives can prepare for expert systems and other new technologies by first, 
developing an informed infrastructure, second, identifying suitable candidate applications, 
and tliird, performing research prototyping with high-rotum domains. In suitable domains, 
archivists should collaborate on research and development projects with other archives, 
government agencies, corporate organizations, or libraries to maximize benefits and defray 
costs. 

The information age is rapidly exploding into the information revdution. Like its nineteenth 
century predecessor, the information revolution will necessitate radical changes in a number 
of modem professions. Understanding the implications and planning for the use of new and 
emerging technologies will equip archivists to deal with an increasingly large segment of 
their mission. 



Information on the Minnesota initiative is from a presentation made by Mary Leonard^ Manager, Minnesota 
Knowledge Systems Center, at the panel ''State and Local Governments Use of Expert Systems/' AISIG '90 
(Washington, DC, May 10, 1990). 
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